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Abstract 

A correlation and probability analysis of the distribution of arrival directions for a 
sample of AGASA events reported to have energies above 4 x 10 19 eV shows the small 
scale clustering to remain significant at the 99.5 — 99.9% CL and to be consistent with 
previous results. For the sample taken as a whole, there are no departures from either 
homogeneity or isotropy on angular scales greater than 5 degrees. The sample of 
events with E > 6 x 10 19 eV contains no pairs with separations < 4 degrees. Cross 
correlating subsamples partitioned by energy reveals three uncorrelated distributions in 
the intervals 4 — 5 x 10 19 eV, 5 — 8 x 10 19 eV, and greater than 8 x 10 19 eV. The partition 
with 5 < E < 8 x 10 19 eV is correlated with the supergalactic equatorial plane while the 
other two groups are statistically consistent with isotropic distributions. The presence 
of three distinct energy-partitioned groups of events could reflect possible changes in 
primary composition, different source distributions, differing levels of GZK losses, or 
deflection effects of magnetic fields. 
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1 Introduction 



Investigating cosmic ray airshowers induced by extremely high energy cosmic rays 
(EHECRs, E > 4 x 10 19 eV) remains an area of intense interest because of an apparent 
paradox posed by the onset of the Greisen-Zatsepin-Kuzmin (GZK) effect at these 
energies. The GZK effect provides an efficient energy loss mechanism for sufficiently 
energetic cosmic ray protons interacting with cosmic microwave background (CMB) 
photons ^ [2] • Specifically, for cosmic ray primaries of energy E > 4 x 10 19 eV, 
photopion production is greatly enhanced through the excitation of the A + (1232 MeV) 
resonance. As the mean free path for this process is ~ few Mpc, it is virtually impossible 
for a proton of initial energy ~ 10 20 eV to travel distances greater than 50 — 100 Mpc 
without losing a large fraction of its energy 0-0- This distance forms the boundary 
of the so-called "GZK sphere" . As is well known, the apparent paradox is that several 
cosmic ray experiments have detected a total of approximately 15 events apparently 
having E > 10 20 eV but with no currently obvious astrophysical source within the local 
GZK sphere. 

Due to the small data sample presently available, relatively little is known with cer- 
tainty concerning these trans-GZK events. In particular, unresolved are the funda- 
mental questions concerning the composition/charge of the primaries and how the 
distribution of arrival directions is related to the spatial distribution of sources. Since 
there is no quantity analogous to galactic redshift to indicate relative distances, statis- 
tical analysis of the arrival directions of cosmic ray induced airshowers is necessarily 
restricted to two dimensional tests over angular areas. And while the angular distri- 
bution is obviously the projection of an underlying spatial distribution, the danger 
of inferring spatial properties from angular characteristics is well known due to the 
impossibility of deconvolving independent projection effects. 

What have been called "small scale anisotropics" as manifested by the apparently non- 
random clustering of EHECR arrival directions in multiplets (doublets and triplets) 
may or may not be correlated with 3-dimensional inhomogeneities. Small scale an- 
gular anisotropies in the distribution of galaxies are associated with localized spatial 
inhomogeneities, and this is similarly true for EHECRs if they originate from sources 
embedded in a matter distribution (luminous or dark). While it is reasonable to spec- 
ulate this is the case, there is as yet no concrete evidence to support even this basic 
hypothesis. At this time, an equally viable hypothesis is to associate the clustering 
with magnetic focusing in which case there is no correlation with a 3-dimensional den- 
sity distribution. It is also possible that the observed distribution of arrival directions 
results from both (or other) causes. 

For this study, we analyze the AGASA data for airshowers with zenith angles < 45° 
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contained in references [7j and jS] plus three additional events with E > 10 20 eV listed 
on the AGASA website jHj- Inclusion of the three newest high energy events listed on 
the website is desirable in order to increase the statistics of the tests used here, but is 
admittedly not optimal since it is currently not possible to include the additional events 
between 4 x 10 19 and 1 x 10 20 eV that will be part of the next AGASA update. Where 
a bias is possible due to this selective inclusion of events, the sample is restricted to 
only the 57 events contained in reference [S] . Although it forms a doublet with a higher 
energy event and is close to the threshold cutoff of 4 x 10 19 eV, we specifically exclude 
the event with E = 3.89 x 10 19 eV because it is impossible to assess the significance 
of this additional pairing without having the positions of all other events of energies 
3 — 4 x 10 19 eV. In addition, although there are data for Haverah Park, Volcano Ranch, 
and Yakutsk events above 4 x 10 19 eV, for the most part, we choose not to include them 
here since a large part of our analysis involves energy cuts, and it remains uncertain 
as to how closely the energy estimates match between the various experiments. This 
limits the significance of the results, but it is probably best to be conservative until, 
at the least, revised energies are published from the re-analysis of the Haverah Park 
data [TO] . 

Previous investigations of AGASA data have found statistically significant clustering on 
small scales with no apparent clustering or anisotropy on large scales; a representative 
sample of model- independent studies are references and [H]-[II|. For the present 
analysis, a robust angular correlation estimator is applied to the AGASA EHECR 
arrival directions to probe for possible departures from homogeneity on all angular 
scales. The grouping of EHECR arrival directions into doublets and triplets is re- 
analyzed with the Monte Carlo simulations used here yielding larger probabilities for 
the presence of small scale clusters but with similar significance compared to previously 
reported results. A cross correlation estimator is then applied to energy-partitioned 
subsamples of EHECRs with the result that for the AGASA sample there appear to be 
three distinct (uncorrelated) distributions with one being preferentially located near 
the supergalactic equatorial plane. 



2 The Two Point Angular Correlation Function 

The two point correlation function is an effective and commonly used statistic to mea- 
sure departures from homogeneity of an observed distribution of points. With no 
distance information available for cosmic ray events, the starting point is the two point 
angular correlation function representing the projection of the spatial correlation func- 
tion on the celestial sphere. It is defined in terms of the joint probability 5P for finding 
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two points separated by an angle 8, 



5P = n 2 [1 + w(6)] 5^ 1 5n 2 , (1) 

where 5Q± and 5^2 are solid angle elements containing two points separated by angle 9 
and n is the mean surface density of points; w(8) = for a homogeneous (uncorrelated) 
distribution of points (for a detailed development of correlation functions see ) • As 
a practical matter, w{9) is computed from the ratio of counts of pair separations for 
the data sample, DD, and the average pair separations from a Monte Carlo generated 
test model, RR, 

W{9) = ^- 1 - (2) 
Previous investigations utilizing this estimator for analyzing cosmic ray arrival direc- 
tions include references [THj and ^7]. However, as shown by Landy and Szalay, the 
estimator in (J2J) generally exhibits variances larger than the values expected when the 
pair counts follow a Poisson distribution ^H]. They analytically developed a modifi- 
cation that possesses a variance naturally free of a higher order term present in the 
variance of the estimator in (J2J). In addition, the Landy-Szalay (LS) form for w(8) is 
robust in reducing spurious correlations resulting from inaccuracies in the assumed test 
model. The LS form for w{6) is 

where it can be seen that the data/random cross-term in Q is responsible for the 
self-correcting and stabilizing feature that reduces both the effects of modeling er- 
rors (including boundary effects) as well as the spurious correlations due to large data 
fluctuations intrinsic to small samples. When boundary effects are small, the data 
sample sufficiently large, and the test model faithfully reproduces the actual average 
background distribution, the results of using equation ((2J) are accurate and agree with 
If the test model is approximately correct, highly significant features will appear 
the same regardless of whether (J2J) or (jHJ) is utilized. Here it is safest to use the LS 
form for w{6) since the data sample of AGASA EHECRs is still relatively small and 
the mean density of the underlying source distribution is unknown. 

The Monte Carlo sampled distribution of EHECR arrival directions is constructed to be 
uniform in right ascension and follows the observed declination distribution for AGASA 
events with zenith angles < 45° as presented in The test model therefore corre- 
sponds to a both isotropic and homogeneous source distribution convolved with the 
sky coverage and detector acceptance of the AGASA experiment. Detailed examina- 
tion shows that the Monte Carlo random catalogs accurately reproduce the polynomial 
fit to the declination distribution over the entire interval of —10° < 5 < 80°. 
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(a) E > 4x10 19 eV 



(b) E > 5x10 19 eV 
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Figure 1: The two point correlation function for a sample of 60 AG AS A events subject to 
energy cuts for a bin size of 2.5 degrees. The two bins marked by the arrow in (d) combine 
to form a 3a point for a bin size of 3°. 



The two point angular correlation function is shown Figure [I]for a bin size of 2.5° and 
for various energy cuts. The small angle clustering in Figures ^a,b) is apparent from 
the > 4o" positive correlation in the first bin (with MC probability of ~ 0.1%). As 
shown, the plots extend only to pair separations of 65°, but there are no significant 
correlations at larger separations for any of the samples. For a bin width of 3°, the 
values of w{6) for the first bin are slightly reduced but with similar significance to those 
with the 2.5° bin size. The positive correlation in the first bin drops very rapidly and 
disappears when the bin width reaches 4° to 5°. The null correlations for bin sizes of 
4° and 5° approximate the results obtained from perturbing the positions around their 
stated values with a Gaussian error distribution. The lack of any statistically signif- 
icant structure at the larger angles demonstrates that when the sample is considered 
in its entirety, the AGASA EHECR distribution of arrival directions is statistically 
homogeneous on all angular scales except for those < 4° . 
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A further comparison of Figures ^a,b) shows that while there are three additional 
pairs in the entire sample compared to the sample cut at E > 5 x 10 19 eV (7 versus 
4), the value of w(8) and the significance of the small scale clustering is somewhat 
greater in the latter sample due to the smaller total number of events (35 versus 60). 
The change in significance is related to the fact that, for example, the probability for 
having zero pairs in a random sample is less for the larger sample (~ 20% for N = 60 
and ~ 60% for N = 35). This result serves as a reminder that while the clustering seen 
in the AGASA dataset is statistically significant, it is quite possible that some of the 
pairing in the sample of 60 events is due to chance projection. Also of interest is the 
lack of clustering of events with E > 6 x 10 19 eV for the AGASA sample analyzed here 
(0 pairs with separations < 4 degrees, 1 pair with separation < 5 degrees). However, 
with only 25 events in this subsample, it is not possible to form any conclusions relative 
to this result. 

The two bins highlighted by the arrow in Figure ^d) combine to form a 3<r result in a 
3° bin at ~ 60° for events with E > 8 x 10 19 eV. As discussed previously, one of the 
strengths of the LS estimator is that similar features appearing in results obtained from 
using the estimator in (J2J) are naturally suppressed by the cross term in (|H|). We have 
checked that this feature is not the result of an average density anisotropy between 
different parts of the sky where the highest energy events are located, but is due to 
the formation on the celestial sphere of two essentially equilateral triangles with event 
arrival directions located at the vertices. Shifting the bin center or increasing the bin 
size to 4° strongly suppresses the correlation. Inclusion of the three newest AGASA 
high energy events has no effect on the significance of the point, but removal of the 
three events between 9 — 10 x 10 19 eV reduces the correlation to insignificance. While 
the Monte Carlo probability for such an occurrence is only ~ 1%, it is suspected that 
this feature is just an unusual chance alignment in the data. Thus, although it seems 
unlikely this is a true correlation indicating structure, it is worth noting for further 
study as larger event samples become available. 



3 Multiplet Counting and Probabilities 

Evaluating the true significance of the observed clustering requires additional consid- 
eration of the probability of occurrence. The two point correlation function involves 
pair counts regardless of whether the pairs occur in doublets, triplets, quadruplets, 
etc. However, since the probability of random occurrence for three doublets can be 
very different than for one triplet, it is important to analytically or numerically de- 
termine probabilities for various multiplet configurations. This implies that the first 
bin of the autocorrelation function may or may not represent the actual probability 
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of a given clustering configuration. This is an expected result since the two point 
correlation function is only the lowest order term in a Taylor series expansion of the 
characteristic function of the probability distribution. The presence of triplets and 
higher order multiplets requires higher order correlation functions to accurately esti- 
mate their probability of occurrence; however, the two point function is sufficient for 
exposing departures from homogeneity. 

Specific values of clustering probabilities depend on the definition of a multiplet. In 
the existing data from AGASA, Haverah Park, Yakutsk, and Volcano Ranch, there are 
two triplets, each having all pair separations less than 2.5°. Thus, in computing the 
probabilities presented here, a multiplet is defined as a set of points within a covering 
circle of diameter AO equal to the experimental uncertainty in angular position. This 
implies that the error circle of radius AO of each point in the multiplet intersects the 
estimated positions of all the other points, or, equivalently, that all n(n — l)/2 distinct 
pairs are separated by < AO. 

Based on probabilities computed from simulation, the approach used here to estimate 
the significance of small scale clustering into multiplets is similar, but not identical, 
to that employed by others (e.g., see In both the data and the Monte Carlo 

catalogs, only distinct multiplets are used in the final results. That is, our counting 
algorithm first identifies all multiplets from doublets through sextuplets for a speci- 
fied opening angle (pair separation), and then correction factors are applied to each 
count to remove the contributions of higher order multiplets on lower orders. This is 
straightforward, but sometimes unexpectedly subtle as, for example, when four points 
combine to form two distinct triplets instead of one quadruplet requiring five instead 
of six doublet pairs to be subtracted from the raw doublet count. Depending on the 
sample size simulated, between 100,000 and 500,000 Monte Carlo trials are used to 
obtain numerically converged probabilities. The random catalogs are constructed in 
the same way as for the correlation functions. Details of our Monte Carlo sampling 
routines together with a comparison of our results with are contained in reference 

Analytical estimates of clustering probabilities based on Poisson statistics have been 
presented previously in, e.g., Dubovsky et al. [12! ■ Goldberg and Weiler constructed 
analytical formulas for exact probabilities and asymptotic forms that differ naturally 
from the pure Poisson estimates due to the presence of system constraints ^H] ■ Further 
results from using the Goldberg- Weiler formulation are contained in Anchordoqui et 
al. ^1] . The method is based on calculating the probabilities of bin occupation num- 
bers for specified bin widths and numbers of bins, and yields probabilities for distinct 
occupation configurations under the assumption that all bins have equal occupation 
probabilities (an approximation to the non-uniform acceptance of ground array detec- 
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Figure 2: Joint probability contours for a sample size of 57 events for various opening angles. 
The solid circles are the multiplet configurations of the corresponding AG AS A dataset. The 
probabilities are logarithmically distributed between contours. 

tors such as AG AS A). The bin area is chosen to be ttAO 2 for a correlation (opening) 
angle of A9 (leading to a bin diameter of 2A9). 

For exclusive doublet configurations with zero higher order multiplets, the pair sepa- 
ration definition of a multiplet used in this study is equivalent to the bin occupation 
probability given by the Goldberg- Weiler formula. For a simulated uniform declination 
distribution, the exclusive doublet probabilities differed by less than f 0% from the ana- 
lytically computed values and were within 30-40% when using the AGASA declination 
distribution. Exclusive triplet probabilities from simulation were also consistent with 
the analytical results as were the joint exclusive doublet /triplet probabilities although 
differences begin to appear more significant when compared with the non-uniform 
AGASA declination distribution. In general, with an appropriate choice for the total 
solid angle used in the analytical formula, the differences between the two approaches 
can be less than a factor of two for all doublet /triplet configurations. As expected, by 
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AG = 2.5° 



AG = 3° 




Figure 3: Joint probability contours for sample sizes of 72 (solid) and 80 (dashed) with the 
solid circle being the multiplet configuration of the corresponding 72 event AG AS A dataset. 
The probabilities are logarithmically distributed between contours. 



requiring all distinct pair separations to be < A9, the quadruplet probabilities shown 
in Figure |1] are smaller by factors of 5-10 compared to those computed analytically 
where the bin diameter is 2A0. 

Joint probability contours have been constructed for various doublet /triplet combina- 
tions as a function of sample size and opening angle from the simulations utilizing the 
AGASA declination distribution. For the sample sizes considered here, the probabili- 
ties for quintuplets, sextuplets, etc. are negligible (P < 10~ 5 ) compared to the doublet 
and triplet probabilities with the quadruplet probabilities also being safely neglected 
(P < 10 -3 ) for Msampie ^ 75 when the opening angle is < 3 degrees. Examples are 
shown in Figures [21 and |3] for contours representing the joint inclusive probabilities 

P{N doub > N° doub ) n P(N trtp > N? rip ) (4) 

given specified values of N9 b and N®-. 
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Figure 4: Inclusive quadruplet probabilities P(quadruplet > 1) for the AGASA declinat 
distribution as a function of event sample size for various opening angles. 

Consideration of the contours in Figure [21 for a bin size of 2.5° shows that the observed 
configuration of 4 doublets and 1 triplet for an opening angle of A9 < 2.5° has a 
probability of less than 0.1% to occur due to random projecton. Further examination 
reveals common qualitative characteristics of clustering probabilities. For example, 
note that although the probability for finding > 2 triplets and > doublets is much 
smaller than for > 6 doublets and > triplets, there are cases where a triplet /doublet 
combination has the same or even greater probability as the same number of pairs 
distributed only in doublets. It can also be seen in this figure that just as in the 
correlation function, the significance of the probability rapidly decreases as the opening 
angle increases. The contours for the larger sample sizes in Figure may be used 
to estimate the statistical significance of doublet/triplet clustering in future AGASA 
event samples. The inclusive probabilities for quadruplets as a function of sample size 
iV < 100 for opening angles of 2.5°, 3°, 4°, and 5° are shown in Figure 

We note that the probability distributions obtained for doublets and triplets for a com- 
bination of AGASA, Haverah Park, and Yakutsk declination distributions are similar 
in shape to those obtained in reference However, a direct comparison of the prob- 
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Table 1: Summary of joint inclusive probabilities for distinct multiplet configurations of 
EHECR arrival directions from AGASA, Haverah Park, Yakutsk, and Volcano Ranch. The 
table entries referring to "low energy" are for events with E < 5 x 10 19 eV. 



Sample 


A9 


Doublets 


Triplets 


MC Probability 


N = 102, all events 


3 


8 


2 


1.0% 




4 


11 


2 


8.6% 


N = 91, exclude HP low energy 


3 


5 


2 


0.7% 




4 


7 


2 


6.5% 


N = 56, exclude all low energy 


3 





2 


0.1% 




4 


1 


2 


0.6% 



abilities found here shows them to be a factor of 2 larger (twice as likely) for the total 
number of uncorrected doublets (i.e., total number of pairs without regard to what 
order of multiplet they appear in), and a factor of 1.5 larger for uncorrected triplets 
for the 92 events considered. The results of Uchihori et al. indicated that the small 
angle clustering for the entire sample of 92 events was only marginally significant for 
an opening angle of 3° and not significant for larger opening angles jllj . 

Although our probabilities for total (uncorrected) pair counts are greater, when con- 
sidering the corrected joint doublet/triplet probabilities for distinct multiplets only, 
we obtain Monte Carlo probabilities numerically similar to those in Values of the 
same order are obtained for the extended sample of 102 events with the results summa- 
rized in Tabled From this table it can be seen that as in the sample of AGASA events 
only, the amount of clusters decreases but the significance of the clustering increases 
as lower energy events are removed from consideration. For example, for A9 = 4°, 
ignoring the likely differences in energy scales between the experiments and arbitrarily 
removing all events with reported energies < 5 x 10 19 eV eliminates all but 1 doublet 
but leaves the 2 triplets with a resulting MC probability of 0.6%. Thus, it is possible 
that the small angle clustering is more significant in the combined sample than previ- 
ously thought; however, it must be emphasized that this cannot be confirmed until the 
revised energies are given for the Haverah Park events. 
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4 Cross Correlations of Energy Partitions 



The two point correlation function in Section 2 is the autocorrelation function for a 
sample of data points. It is often of interest to compute the cross correlation of two 
data samples. In this case, the basic form analogous to © is 

^TTTT- 1 ' (5) 

where the subscripts 1 and 2 refer to the different samples. It is straightforward to 
construct an LS estimator analogous to (jHJ) that reduces to ® when D\ = D2, 

D X D 2 \{D 1 R 1 + D 1 R 2 + D 2 R l + IhBa) ^ 

( U > = ~R~R~~9 P~R + ' W 

IX\tX2 £ tX\IX2 

and where X{9) = implies the two samples are statistically uncorrelated. 



(a) Events with E > 4x1 1 9 eV (N = 60) 
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(b) Events with E > 5x1 1 9 eV (N = 35) 
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Figure 5: Cross correlation function values in the first bin for the AGASA events above and 
below various energy thresholds (bin size = 2.5°). The numbers indicate the MC probability 
of occurrence where highly correlated samples have small probabilities and vice versa. 
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Figure 6: The cross correlation function for AGASA events with 5 x 10 19 < E < 8 x 
10 19 andE > 8 x 10 19 eV showing that these two distributions are largely uncoupled at all 
angles (the 1.3a positive correlation value in the first bin is statistically insignificant). 

As a starting point, the values for the first bin of X (9) for a bin size of 2.5° are shown 
in Figure 02a) above and below a given energy threshold for the 60 AGASA events. 
As there are no events in this dataset with energies between 8 and 9 x 10 19 eV, the 
results for the energy cut at 8 x 10 19 eV are the same as for 9 x 10 19 eV. Although the 
first bin values of X{9) are positive for all cases, only those corresponding to < 1% 
probability are considered significant. Thus, the data indicates that events below 
and above 5 x 10 19 eV are not strongly correlated, and the same holds true for the 
threshold energy 8 x 10 19 eV. As shown in Figure |Sfb), after removing the 25 events 
with E < 5 x 10 19 eV, the independence of the events below and above 8 x 10 19 eV 
and the strong autocorrelation of the events with 5 < E < 8 x 10 19 eV are even 
more apparent. These results suggest the division into three groups with energies 
4<£<5x 10 19 eV, 5 < E < 8 x 10 19 eV, and E > 8 x 10 19 eV. 

Using the three energy partitions as defined above, the cross-correlation functions are 
computed for the three possible combinations. Figure |H1 shows the statistical indepen- 
dence at all angles of the middle and high energy partitions. The results for the other 
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two cases are very similar with the exception of two 3a points appearing at angular 
separations of ~ 45° and 70° in the cross-correlation of the E < 5 x 10 19 eV and the 
E > 8 x 10 19 eV partitions. For both points, the correlation values in the immediately 
adjacent bins are negative for the 2.5° bin width. This suggests that both points are 
probably unusually large fluctuations similar to the large scale feature in the 2-point 
function described in Section 2. As expected for this type of feature, the values for 
these points using @ are reduced compared to those obtained from using Q. 

Of course, more data is required to confirm the existence and independence of these 
energy partitions. If confirmed, the implications are that events separated in energy 
are being influenced differently or are of different origin or composition. This could be 
especially important in helping to determine the nature of the trans-GZK distribution 
above 8 x 10 19 eV. 



5 Anisotropy and the Supergalactic Plane 

As noted, the correlation function reveals no compelling departures from homogeneity 
on large angular scales for either the entire or the energy-partitioned samples. It is 
then natural to check whether the events appear to show directional dependences with 
respect to coordinate systems such as galactic or supergalactic. Testing for anisotropy 
begins with sector counts of data compared to expected values from simulation. This 
is sufficient for the analysis here although it should be noted that it is possible to 
describe deviations from isotropy by constructing statistical estimators such as plane 
enhancement factors. 

Figure [7| shows the distribution in supergalactic (SG) coordinates of the 60 AGASA 
events where it can be seen that the arrival directions appear to be isotropically 
distributed. However, referring to Figure |H1 constructing similar histograms for the 
three partitions described in the previous Section reveals that the group with energy 
5 < E < 8 x 10 19 eV is aligned with the SG equatorial plane at the 3a level (corre- 
sponding to a 0.6% probability). The lower and higher energy groups are consistent 
with statistically isotropic distributions although the 2.5a deficit near the SG equa- 
tor for the low energy group should be checked in future studies. The SG longitude 
distributions for the three energy-partitioned samples cannot be distinguished from 
isotropically distributed sources. 

As an independent test of the significance of the alignment of the 5 < E < 8 x 10 19 eV 
group with the SG plane, the energies of the 60 AGASA events were randomly shuffled 
among the (fixed) event coordinates. Aftr 100,000 trials, there were an average of 
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Figure 7: The distribution of 60 AGASA events in supergalactic coordinates. The points 
marked by solid circles are the expected values obtained from the Monte Carlo simulation 
under the assumption of isotropically and uniformly distributed sources. The 1-sigma error 
bars are derived from the simulation results and are approximately Poisson. 



4.2 ±1.5 events with -10 < SGB < 10°, and in only 0.2 - 0.3% of the trials did the 
number equal or exceed the observed value of 9. This probability is of the same order 
as the 0.6% found from MC sampling the coordinates, and confirms the statistically 
significant nonrandom alignment of this partition with the SG equatorial plane. 

A possible correlation between EHECRs and the Local Supercluster was hypothesized 
by Stecker in 1968 based on his calculations of propagation paths attenuated by pho- 
tomeson production [3]. Stanev et al. noted in 1995 that EHECR arrival directions 
show a correlation with the SG plane, and that this was consistent with the hypothesis 
that powerful radio sources within the Local Supercluster, being concentrated in this 
plane, could be the production sites of high energy cosmic rays |20j . Following this, 
other studies have pointed out that some of the multiplets are preferentially aligned 
in this direction [7| 1111 121j . Because the AGASA triplet is within a few degrees of 
the SG equator, and the energies of the triplet are between 5 and 8 x 10 19 eV, the 
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(a) E > 4x10 19 eV 



(b) 4 < E < 5x10 19 eV 
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(c) 5 < E < 8x10 19 eV 
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Figure 8: The supergalactic latitude distribution of 60 AG AS A events partitioned by energy. 
The points marked by solid circles are the expected values obtained from the Monte Carlo 
simulation under the assumption of isotropically and uniformly distributed sources. The 
excess of events in (c) from —10° < SGB < 10° is at the 3a level (9 observed versus 
3.68 ± 1.74 expected) with a Monte Carlo probability of 0.6%. 



statistical significance of the results are strongly influenced by this structure. Due to 
the anticipated relative differences in energy scales, it is not possible at this time to 
further pursue the correlation of the arrival directions with the SG equatorial plane as 
a function of energy utilizing the combined data from other experiments. 

If the observed anisotropy and correlation with the SG plane of the middle energy group 
is in fact related to a source distribution residing within the Local Supercluster, then 
it is probable that this same energy group should also be anisotropically distributed 
with respect to SG longitude. Specifically, with our Galaxy being located toward the 
periphery of the Local Supercluster, there are many more galaxies along the Galaxy 
line-of-sight direction corresponding to SGL ss 0° (toward the Virgo cluster which lies 
at the center of the Local Supercluster) than along the opposite direction of SGL rs 
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180°. However, inspection of Figure 7 reveals that there is no AGASA coverage along 
SGL values where deficits would be expected to exist. Thus, determination of this 
possible anisotropy requires data from a different experiment and cannot be analyzed 
from the current dataset. 



6 Summary 

In the above analysis, we consider features with > 3a values (< 1% MC probability) 
to be potentially indicative of structure or to warrant further examination with larger 
datasets. Features at the > 2 — 2.5a level also merit further attention, but it is prema- 
ture to attribute significance to them in this study. The results may be summarized as 
follows: 

1. The AGASA EHECR arrival directions continue to exhibit clustering on scales 
< 3° at the 4a level with probabilities of random occurrence of ~ .1 — .5%, and con- 
tinue to show no compelling departures from homogeneity on scales > 5°, 

2. Partitioning the AGASA sample by energy yields three uncorrelated groups with 
E <5x 10 19 eV, 5 < E < 8 x 10 19 eV, and E > (8 - 10) x 10 19 eV, 

3. The partition with 5 < E < 8 x 10 19 eV appears to be preferentially aligned with 
the SG plane with a probability for random occurrence of < 1% while the other two 
partitions are statistically consistent with isotropic distributions. Supporting evidence 
that this anisotropy is related to the Local Supercluster would be an observation of 
an anisotropy in the numbers of events toward SG longitudes in the neighborhoods of 
0° and 180°, but this is not yet possible due to the limited sky coverage of the AGASA 
experiment, 

4. Combining the 102 events reported to have energies > 4 x 10 19 eV from AGASA, 
Haverah Park, Yakutsk, and Volcano Ranch and considering inclusive joint probabili- 
ties of distinct multiplet configurations, we find MC probabilities of < 1% and 5 — 10% 
for opening angles of 3° and 4°, respectively. 

5. Without regard to differences in energy scales between the four experiments, 
eliminating all events in the 102 event sample with energies reportedly < 5 x 10 19 eV 
yields a clustering probability of 0.1% for the (0 doublet, 2 triplet) configuration ob- 
served for an opening angle of 3° and a 0.6% probability for the (1 doublet, 2 triplet) 
configuration observed for an opening angle of 4°. 

The change in significance of the clustering probability for the 102 event sample in 
making the energy cut and reducing the observed configuration from 11 doublets and 
2 triplets to 1 doublet and 2 triplets (A9 = 4°) underscores the unlikely chance for 
observing 2 triplets in a sample containing only ~ 50 — 60 events. 

It should also be noted that the values of the energy partitions obtained for the AGASA 
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events may or may not correspond to an absolute energy scale as it is possible that the 
AGASA energy calibration could be revised in the future. Nevertheless, the energies 
found here are consistent with certain expectations of regimes where GZK or magnetic 
field effects change their relative importance. And, of course, being based on relatively 
few events, these results can be significantly affected by the addition of the next 15 to 
20 events. 

Beyond the characterization of statistical properties, a natural goal of this type of anal- 
ysis is to determine underlying causes. With so little known concerning the nature of 
the EHECR distribution, it is tempting but difficult to extrapolate statistically signif- 
icant features of small datasets to physical properties of the parent distribution. Here 
that means attempting to associate the above results with repeating sources, changes 
in primary composition or source distribution, or magnetic lensing. The difficulty is 
compounded by the lack of knowledge of how many physical mechanisms are influenc- 
ing the observed distribution. For example, it is plausible that if there exist neutral 
EHECR primaries unaffected by GZK losses or magnetic fields, then there also exist 
charged primaries that are so affected. In that event, a single type of production mech- 
anism gives an observed spectrum that is a superposition of charged primaries from 
sources residing within the local GZK sphere, and neutral primaries that may originate 
from sources both within and beyond the GZK sphere. 

What is fair to hypothesize based on the results here is the existence of three distinct 
energy distributions (presumably of extragalactic origin) comprising the observed spec- 
trum of EHECR events and that may possess the following properties: 

1. Cosmic rays with E < 5 x 10 19 eV comprising the high energy tail of a primary 
distribution largely unaffected by GZK losses, 

2. Primaries, possibly protons, with 5 < E < 8 x 10 19 eV that have lost energy 
through the GZK effect, and, with arrival directions being aligned with the supergalac- 
tic plane, may originate from sources located within the Local Supercluster, 

3. trans-GZK primaries of unknown composition and origin with energies E > 
8 x 10 19 eV that may or may not be losing energy via the GZK effect. 

The consistency of this interpretation can be checked further once the energy scales of 
existing datasets are brought into accordance, but confirmation or rejection probably 
requires a larger single dataset as will be provided by Auger. 
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